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Abstract 

The inductive size bias coupling technique and Stein's method yield a Berry-Esseen 
theorem for the number of urns having occupancy d > 2 when n balls are uniformly 
distributed over m urns. In particular, there exists a constant C depending only on d 
such that 



sup|P(T^„, 



<z)- P{Z < z)| < C 



0"n 



for all n > d and m > 2, 



where Wn^m and o"^ „ are the standardized count and variance, respectively, of the 
number of urns with d balls, and Z is a standard normal random variable. Asymptot- 
ically, the bound is optimal up to constants if n and m tend to infinity together in a 
way such that n/m stays bounded. 

1 Introduction 

In this paper we provide a Berry-Esseen theorem in the classical occupancy problem for 
the normal approximation of the distribution of the number of urns having occupancy d 
when n balls are uniformly distributed among m urns. Our pr oof relies on the inductive 
version of Stein's method using size bias coupli ngs as presented inlGoldsteinI (120121 ). In turn, 
that work springs from the use of induction in iBolthausenI ( 1984 ). achieving bounds for the 
combinatorial central limit theorem. The inductive method relies on expressing a bound for 
the distance of the given variable to the normal in terms of smaller versions of the same 
problem. For instance, in the occupancy model, conditional on the contents of a randomly 
chosen urn, the distribution of the remaining balls is uniformly distributed over one fewer 
urn. 

Stein's method often proceeds by coupling a random variable Y of interest to a related 
one using, for examp l e, the metho d of ex chan geable pairs, size b ias couplings, or zero bias 



one usmg, tor examp l e, tne metno d oi ex cnan geabie pairs, size b ias couplings, or zero bias 
couplings (see ISteinI fll972[ ). ISteinl fll986[ ) and IChen et"aLl fl2010[ )l However, some of the 
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couplings that are the simplest to construct may lack a key boundedness property that is 
required for the application of many results. By applying a theorem that does no t require 
the co upling to be bounded, in Theorem 11.11 we a re able to exte nd the work of lEnglund 
(jl98lh on the number of empty urns, and that of iPenrosd (120091 ) on the number of urns 
occupied by a single ball, to the case of all occupancies of size two and greater. 

In the general multinomial occupancy model, one considers a vector M„ having compo- 
nents Mn{i) that record the number of balls falling in urn i in n independent trials, where 
in each trial a single ball falls in urn i with probability 6i for alH > 1. In particular, the 
(multinomial) distribution Ai{n, 9) of M„ is given by 



P{Mn{i) = mi,i>l) 



n ^r- 



when rrii, i > 1, are nonnegative integers summing to n, and ^ G where 



e 



■ Oi>0,i>l, 



i>l 



(1) 



For all (i > the number Y'i'^^ of urns containing d balls is therefore given by 

y^'^ = T.^n} where X^fj = m^t) = d). 



(2) 



i>l 



Among the many applica tions of mu l tinorn i al occupancy rn ode ls are thewell-known 
species trapping problem (see IChao et al.l (119961 ). iRobbinj (Il968[). or lStarr (119791)1 and the 



closely -related problem of statistical linguistics (see lEfron and Thistedl (119761 ) and lThisted and Efron 
(119871 )). In these applications a collection of species are trapped, or a collection of words are 
observed, according to the multinomial distribution Ai{n,6), and estimators of parameters 
related to the number of unseen species, or words known but unused by an author, are of 
central interest. Estimators of, say, the number of unknown species most often take the 
form of linear combinations of Yri'^^ for various d. For example, a well-known conjecture of 



StarrI (119791 ) is that the uniformly minimum variance unbiased estimator, or UMVUE, of the 
probability of sampling a new species in a sample of size n — riQ, based on an original sample 
of size no, is 



n—UQ /n— ng — 1\ 

E V d-1 ) y(d) 
/n\ n ■ 

d=l 



For occupancy models where n balls are distributed among the first m urns only, the 
urn probability vector 9 is given by (6*1, . . . , 9m, 0, 0, . . .) G 6 as in ([T]). Below we will find 
it convenient to continue to consider the case where the urns are indexed by alH > 1, even 
though all but the first m of them will be empty. 

In what follows we fix c? > and drop the superscript {d) from our notation, denoting 
X^fl and Yn'^^ simply as Xn,i and y„, respectively. Kolchin et al. ( 1978 . p. 37) show that 
the mean fin,m and variance cx^^ of the number Yn of urns occupied by d G {0, 1, . . .} balls. 
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when n balls are distributed uniformly over m urns, are given by 

n—d 



a J mr \ m 

= l^n,m + m{m - 1 



and 



n 



d,d,n — 2dj m?'^ 



1 - 



m 



n~2d 



2 



(3) 
(4) 



for aA\ n > d and m > 2, with the second term in 
m > 2. 



set to zero for aA\ d < n < 2d and 



The cases d = and d = 1 having already been handled by lEnglundl (119811 ) and [Penrose 



( I2OO9I ) respectively, we focus on d > 2. Our main result is the following. 

Theorem 1.1 For d G {2, 3, . . . , }, let Yn be the number of urns containing d balls in the 
uniform occupancy model with n balls and m urns. Then, with fin,m given by cr'^ ^ by 
and 



1 + [n/my^ 

there exists a constant C depending only on d such that the standardized count 



(5) 



satisfies 



sup \ P{Wn,m < -z) — P{Z < z)\ < C/vn^m for all u > d and m > 2. 



Regarding lower bounds, lEnglundl ( llQSll . Section 6) shows that in the case d = 0, 
sup \P{Wn,m <z)-P{Z<z)\> 0.087/ max(3, (T,,^) 



(6) 



(7) 



and we remark that Englund's argument holds without changes for any random variable 
Wn^m with finite variance supported on the integers, and so for the d > cases of the 
occupancy problem in particular. 

Although Theorem 1 1 . 1 1 yields a bound for all n > c? and m > 2, often interest centers on 
the behavior of a sequence of occupancy models where n and m vary together in such a way 
that the ratio of n to m is bounded away from zero and infinity, that is, when there exist 
< a < 6 < oo such that 



n 

a< — <b. 

m 



(8) 



Note that the bound on the supremum norm in ([6]) achieves the rate l/(T„ m; optimal in view 
of ([7]), when an^mlfn,m is bounded away from infinity, or equivalently, when the upper bound 
in (IHl) holds. These observations yield the following immediate corollary; see also Section H] 
for a more detailed discussion of these and other asymptotic regimes. 
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Corollary 1.1 For any b G (0, oo) there exists a constant C, depending only on d & 
{2, 3, . . .} and b, such that 



sup \P{Wn,m <Z)-P{Z<Z)\< C / Cn,. 



for all n > d and m > 2 that satisfy n/m < b, and the bound is optimal up to constants. 



Specializing the broad results of Theorem 4.2 of IChen and RollinI ( 120101 ) for general functions 
of urn occupancies to the case considered here under ([8]) results in a bound in Kolmogorov 
distance in the central domain such as the one here, with explicit constants but additional 
factors of log n to various powers. 

To begin to describe the first ingredient required for the proof of Theorem II. ![ the 
construction of a size biased coupling, recall that for a nonnegative random variable Y with 
finite, nonzero mean n, has the F-size bias distribution if 



E[Yf{Y)] = fiEf{Y^ 



(9) 



for functions / for which the expectat i ons above exist. In eniployin g the size bias ve r sion o f 
Stein's method, see lSaldi et "aP f ll989h . [Goldstein and RinottI fll996[ ) and lChen et~aD fcoioh . 
the goal is to construct, on the same space as F, a variable Y^ with the F-size bias dis- 
tribution, such that Y and Y'^ are close is some sense. Previous applications of size bias 
coupling in Stein's met hod for producin g bounds in the Kolmogorov distance in the presence 
of dependence, but for iGoldsteinI (120121 ). have required that — F| be bounded. 

To size bias the number of urns y„ containing d balls, note that when n balls are uniformly 
distributed over m urns, Yn in ([2]) is the sum of m exchangeable indicators. In general. Lemma 
12. II below says, essentially, that to size bias such a sum one chooses an indicator uniformly, 
sets it to one, and then 'adjusts' the remaining indicators, if necessary, to have their original 
distribution given that the selected indicator now takes the value one. In the occupancy 
problem, to set an indicator for a chosen urn equal to one if it is not so already, one must 
either add balls to that urn if it has fewer than d balls, or redistribute balls from the urn if it 
has an excess over d. As it is possible that the chosen urn has, say, all n balls, the resulting 
coupling fails to be bounded in n. However, as there is small probability that a very large 
number of balls will need to be redistributed, the coupling can be controlled using quantities 
such as moments on bounds Kn on the absolute difference between Y^ and F„. 

To describe the second ingredient in the application of Theorem 13.11 in general, the 
inductive component, suppose that for some nonnegative integer ni, for all n > ni we are 
given a nonnegative random variable Yn whose distribution Cg depends on a parameter ^ in a 
topological space G„. As bounds to the normal for Yn can be expressed in terms of a number 
of quantities, including bounds to the normal for 'smaller versions' of the same problem, an 
inductive argument yielding a recursion for the bound may be constructed when for a certain 
collection of random variables J„, L„ a random variable taking values in {0, . . . ,n} and ijjn,e 
a parameter in Qn-Ln, there exists a random variable Vn on the same space as Yn such that 

Ce{Vn\Jn) = C^^g{Yn-L^) 

holds on a set where the size of L„ is controlled. One must also control the difference 
between Yn and Vn, but again strict boundedness is not required ow Yn — Vn but rather 
moment estimates of a bounding random variable Bn satisfying \Yn — Vn\ < Bn- 
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Regarding the inductive component for our occupancy problem, if the urn chosen to 
have occupancy d in the size bias configuration is removed, then, conditional on the identity 
of that urn and the number of balls it contains, the remaining configuration has the same 
uniform multinomial distribution over the remaining urns, one fewer than the number in 
the original configuration, of the balls not contained in the urn chosen. And again, as with 
the bound Kn on — though it is possible that the chosen urn contains a very large 
number of balls, it is unlik ely that it will. 

In the uniform model, lEnglundl (jl98ll ) gave an explicit Berry-Esseen bound of order 
l/c"n,m, with a corresponding lower bound (JTj) of the same order, for the number of occupied 
urns, or equivalently, for the number of empty urns forming the complement, t hat is, those 
with occupancy d = 0. For the non-uniform ca se, iQuine and RobinsonI (119841 ) gave a less 
explicit error bound. iHwang and JansonI (120081 ) obtained a local limit theorem, and also 
descri be ap plications inclu d ing sp ecies trapping and statistical linguistics. IJohnson and Kotz 
( 19771 ) and iKolchin et al.l ( 19781) g i ve res ults for models of this type in the uniform and 



some non-uniform cases. iPenrosd (120091 ) considers the case d = 1 where Yn counts the 



number of isolated balls, and obtains a Berry-Esseen bound via size-biased coupling in the 
unifor n i case, and fo r the n on-u niform case as well with a slightly larger constant. iKarlin 
(119671 ) . iGnedin et al.l (120071 ) and lBarbour and GnedinI (120091 ) consider the infinite occupancy 
model, the first two proving central limit theorems for the number of occupied urns, the last 
providing a multivariate normal approximation for arbitrary occupancies of a fixed number 
of urns. 

In Section [2] we construct the coupling of Yn and the size biased variable Y^. In Section [3l 
with the help of Lemma 13. ![ we prove Theorem 11.11 by verifying the conditions of Theorem 
13.11 Some discussion is provided in Section |U and the proof of Lemma 13.11 is given in 
the Appendix. With Z the set of integers, let = Z fl [k,oo). Throughout, we will 
use C,Ci,C2, . . . to denote positive, finite constants depending only on d. Since in what 
follows we focus on the uniform occupancy problem, for notational simplicity we specify the 
multinomial probability vector by m G Ni rather than by the corresponding vector 



(l/m,...,l/m,0,0,...) 



(10) 



m 



and write N2 for our parameter space. When considering subsets On ^ © for some n G N 
and invoking Theorem 13. H statements such as m G On should be interpreted as meaning 
that 6m G 0„. Further, we will denote the uniform multinomial distribution of n balls over 
m urns as Ai{n,m), in parallel to our notation for the binomial B{n,p) distribution with n 
trials and success probability p. 



2 Size Bias Coupling 



A gen eral prescription for size biasing a sum of nonnegative variables is given in [Goldstein and Rinott 
( 1l996l ): specializing to exchangeable indicators yields the following result. 

Lemma 2.1 Suppose Y = Xloex"^"' ^/^^^^^^ ■^'W'Ti of nontrivial exchangeable Bernoulli vari- 
ables {Xa, a G Z}, and that for a E Z the variables {X^, /3 G X} have joint distribution 

£(X", /3 G X) = C{Xfs, 13 G X|X, = 1). 
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Then 

has the Y size biased distribution characterized by as does the mixture when I is 
a random index with values in X, independent of all other variables. 

Proof: First, fixing a G X, we show that satisfies ([9]). For given /, 
E[Yf{Y)] = Y,E[Xpfiy)] = E^l^/^ = = 

As exchangeabihty imphes that E[f{Y)\Xi3 = 1] does not depend on /3, we have 

E[Yf{Y)] = = 1] ) E[fiY)\X^ = 1] = E[Y]E[fiY^)], 

\l3ex J 

demonstrating the first result. The second follows easily using that Y^ is a mixture of random 
variables all of which have distribution V*. □ 

With n > d we prove Theorem 11.11 by constructing a size bias coupling of Y^ to Yn 
for the urn model and verifying the hypotheses of Theorem 13.11 To apply Lemma 12.11 we 
construct, for each i G {1, . . . , m}, a configuration that has the conditional distribution 
of M (n, m) given that urn i contains d balls on the same space as a configuration M„ with 
the unconditional distribution Ai{n,m). 

We now describe the joint construction of and M„ formally; in its course we will 
also define the vector specifying the difference, up to sign, between M„ and M^. For a 
vector M and a given i > 1, let (M)j be the vector obtained by deleting the i*^ component 
of M. 

With i G {1, . . . , m} we first specify the i^^ components of M„ and by letting Mn{i) ~ 
B{n,l/m) and M^{i) = d, respectively. Next, let vectors M^- and satisfy M'^^{i) = 
Rn{i) = 0, and whose remaining components are conditionally independent given M„(^), 
with conditional distributions given Mn{i) specified by 

£((M:^_,),|M„(z)) =M{n- M^{i) V m - 1) 

and 

£((R^),|M„(^)) = M{\d - M„(2)| ,m - 1), (11) 

and set 

(M„), = (M;,), + l(M„(^) < d) (R^). and (M^), = (M^,), + 1(M„(^) > d) (R^),. 

By the additive property of the multinomial distribution, conditional on Mn{i) we have that 
{M.n)i ^ Ai{n — Mn{i),m — 1) in all cases, so that M,„ ~ Ai{n, m), as required. Likewise in 
all cases (M^)j Ai{n — d, m — 1) , so 

M„ ~ M{n, m) and £(M^) = i:(M„|M„(z) = d). (12) 

Further, we note that that the difference between the two configurations excluding urn i 
satisfies 

(M^), - (M„), = sign(M„(0 > d){KJ,, where J] B^^ij) = \d - M„(0|. (13) 
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Applying the indicator function 1(- = d) coordinate-wise to f[T2|) and recalling ([2]) we 
obtain 

and Lemma [2.11 now yields that Y^, counting the number of urns containing d balls in the 
configuration M^, given explicitly by 

Y: = J2Kj, with X;^. = l(M^(j) = rf) forj>l, 

has the y„-size biased distribution. Again by Lemma 12.11 if J„ is uniformly distributed 
over {1, . . . , m}, independent of all other variables, then = Y^" also has the F„-size bias 
distribution. 



3 Auxiliary Results and Proof of Theorem 1.1 



To prove Theorem 11.11 we utilize a general result of IGoldsteinl toij ). given as Theorem 13.11 



below, whose framework has already been described in Section [H In particular, the random 
variables of interest Yn,n > uq have distributions CeiYn) that depend on a parameter 6' in a 
topological space 6„, also endowed with a a- algebra of subsets. In our application we give 
©n = the discrete topology, and the cr-algebra the collection of all its subsets. 

In Theorem 13. 11 r^ g is a function that determines the quality of the bound to the normal, 
the sequence Sn,e is used to control a random variable L„ e determining the size of the smaller 
subproblem Vn related to F„. In general, the mean fin,0 and variance a'^g of Yn under Cq, 
and rn,8, are required to be measurable in 6, a condition satisfied for all natural examples, 
and in particular, for the one considered here. 

Theorem 3.1 For some uq G Nq and all n > no and 9 G Qn, let Yn be a nonnegative 
random variable with mean fin,0 and positive variance cr^g, and set 

Wn.e = (14) 

the standardized value of Yn- Let r„^e be positive for all n > Uq and 9 G ©n, and for all 
r > 0, letting 

Qn,r = {9eQn: Tnfi > r}, (15) 

assume there exists ri > and ui > uq such that 

max sup r„9 < oo. (16) 

no<n<ni 6lge„,ri 

Further, suppose that for all n > Ui and 9 G ©n.n? there exist random variables Y^, Kn, Ln, 
"^71,9, Vn one? Bn on the same space as Yn, and a a-algebra Tn, generated by a collection of 
random elements Jn, such that the following conditions hold. 
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1. The random variable has the Yn-size bias distribution, and 

'^nfi = V Vare {Ee{Y^ - F„|F„)) satisfies sup ^"'^^"^^^"'^ < oo. (17) 

2. The random variable Kn is J-'n-measurable, \Y^ — Yn\ < Kn, and 

sup — ■ ■ 3 ■ < OO, (18) 

n>ni,eee„,rj^ '^n,e 

with Wnfi as given in 

3. The random variable L„ takes values in {0,1,..., n}, there exists a positive integer 
valued sequence {snfi}n>ni satisfying n — > ^o, the variables and i/jn,e are 
J^n-measurable, with Fn,e e J^n,Fn,e C {L„ < Sn,e}, 

i^n,e ^ Qn-Ln and CeiVnlJn) = 'C^^^giYn-Lj on Fn,e (19) 

and 

2 

sup "j^^Ee [Klil - 1(F„,,))] < oo. (20) 



4- There exist {Ci,C2} C (0, oo) such that 
5. The random variable is J^n-i^Gasurable, \Yn — Vn\ < Bn and 



c^le <Ci(rl_r^^„ and r„,e < C2r„„i„,^„ « on 



rlgfinfiEelKlB,,] 
sup — ■ ^ < oo. (21) 

6. Either 

(a) there exists Infi G No such that Pe{Ln = /„,o) = 1 for all 9 G Qn,ri 
or 

(b) the set Qn,Ti is a compact subset ofQn, and the functions of 9 



tnfi 



Ee (^-^1{L^ = 1)^ fori e {0,1,..., n} 



are continuous on Qn,ri for / G {0, 1, . . . , s„} where s„ = sup^gQ^ Sn,e- 
Then there exists a constant C such that for all n > uq and 9 E Qn 
sup \Pe{Wn.e < z) - P{Z < z)\ < C/r^.e- 
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When higher moments exist a number of the conditions of the theorem may be verified 
using standard inequahties. In particular, by the Cauchy-Schwarz inequahty a sufficient 
condition for ffTSl) is 



1/2 



sup "3" "'"'"^ < 00 where kn,9,m = EeK^^, (22) 

and, when F„ g = {L^ g < s„.e} then a sufficient condition for fl20|) is 

1 1 

2 / 2 / 2 

sup 3 < 00 where ln,9,m = EgL^ , (23) 

n>ni,9G&„.ri '^n,9^'n,9 



since, additionally using the Markov inequality yields 



1.2 12 



^n,9 

Similarly, a sufficient condition for ( |2T1) is 

2 ~ ~ 

'^n,6l/^n,e^^,6l,4^n,e,2 ^ I. h TP um 

sup 1 < Qo where On^g.m = EqB^. 

Recalling = Zfl [A;, oo), applying Theorem 13. II to the occupancy problem we let no = 
fn,m be given by ([5]), and 9„ = N2 for all n > no, making note of the identification between 
positive integers m and elements given by (1101) that lie in the set 6 of ([T]). 

Before starting the proof of Theorem 11.11 we collect some crucial facts needed later re- 
garding the behavior of the mean and variance of Yn. Letting 

g ( ^ 

^rf(^) = and ipd{x) = 1- Td{x) - Td{x) , (24) 

al X 



Kolchin et al.l ( 119781 . p. 37-38) show that, for all n,m > 1 and d > 0, 

^^n,m < mTd{n/m)e'/'^ (25) 
and that for n, m — )• 00 such that n/m = o(m), 

/in,m = mTdin/m) + Td{n/m) (^d - ^ - + 0(l/m) 

and 

^n,m =mTd{n/m)<^d{n/m){l + o{l)). (26) 
The following lemma gives further properties of fin,m and a.^ ^, and is proved in the Appendix. 
Lemma 3.1 1. For any fixed n > d > 2, 

hm = (27) 

and the set Qn,ri, given in l[TS\) with rn.m o,s in is finite for all ri > 0. 
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2. Let d > 1. There are constants C3, C4, C5, depending only on d, such that, for alln > d 
and m >2, 

(b) < CiU and a^ ^^ < C^n. 

3. Let d>2. With (fdix) given by 'mf^yo(pd{x) > 0. 

4- Let d > 2. Given any n*,m* and e > there are constants ri and Cq such that all 
n, m satisfying <Jnm — ^i ^^^'^ satisfy 

(a) n > n* and m > m* 

(b) n/m < (1 + e) logm 

In keeping with the notation of Theorem 13.11 and the identification between elements in 
Ni and 9 in ([T]) and as described at the end of Section [T], in the following we will use Em, 
Varm, and Pm to respectively denote expectation, variance, and probability with respect to 
a multinomial distribution with probability parameter (fTOi) . 

Proof of Theorem 1.1 

We prove Theorem II. II by verifying the conditions of Theorem 13. II When n = d and m > 2, 
the probability that all d balls fall in urn 1 is positive, as is the probability that d — 1 balls 
fall in urn 1. Hence P{Yn > 0) and P{Yn = 0) are both positive, so y„ is not constant almost 
surely, and its variance cr^^ is strictly positive. The same conclusion holds for n > + 1 
and m > 2 by considering the event that d balls fall in urn 1 and n — d in urn 2, and the 
event that all balls fall in urn 1. Hence Vn^m given in ([5]) is also positive for all n > uq and 
m G 6n- In lieu of naming ui and ri explicitly, we show that the conditions of Theorem 13.11 
are satisfied by choosing ui and ri sufficiently large. By Part [1] of Lemma [3.11 the set On,ri 
is finite for all n > and ri > 0, hence f|T6l) is satisfied for any such pair. 

To help with the verification of the six conditions of Theorem 13.11 we first note that 
Parts [2] and m of Lemma [3TT] allow us to choose ri > such that there exist positive constants 
(73,6*5, Cq such that cr^ > Vi implies 

(Tn,m < C-illn,m (28) 

aim < C,n (29) 
^^n,m < Ceal m (30) 

Tl I 

— <21ogm<vW3 (31) 
m 

m > 3. (32) 
Below we will repeatedly use these bounds along with the fact that 

9„^ri C {m : (T^_„ > Tl} for all n > and m > 2, (33) 
which follows from directly from f|T5|) and the fact that r„ ,„ < On^m- 
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Verification of Condition [T] 

We verify inequality ( ITTl) in Condition [T] of Tlieorem 13. II by sliowing tliat, for constructed 
as in Section Ej tliere is a constant C7 and integer rii E Ni such that for all n > rii and 
'm G 6n,ri, the quantity '^n,m. satisfies 

< c,i±M!!!)!. (34) 

\ n 



Inequality (IMl) implies f lTTl) as 

'^n,mf^n,m^ n,rr 



2 — : ^u' ^ li^rti — : ; / — — : / V ^5; 

where we have used fl30|) and fl29l) . Hence we turn our attention to showing ([3 
By conditional Jensen's inequality, as is a function of M„, 

Yaim{E^{Y: - F„|F„)) < Yai^{E^{Y: - r„|M„)). 

Recalling that J„ is chosen uniformly from {1, . . . iTo], independently of the configuration 
M„, and that > 1 is the indicator that urn j contains exactly d balls in the size biased 

configuration, we have that 

Averaging over /„, we obtain 

m 

E„K-F„|M„] = -Vl(M„(0^rf) + - V PUM':,{j) = d\MMMn{j) ^ d) 

m ^-^ m ^-^ 

-- V P„(M:(j) ^ d|M„)l(M„(j) = ci). (35) 
m ^-^ 

To understand the first sum, note that since urn always contains d balls in the size biased 
configuration, X.^^^^^ - X„,/^ = 1 - l(M„(/n) = d)) = l(M„(/„) ^ d)), so averaging over /„, 
which takes the values 1, . . . ,m each with probability 1/m, yields the first term. The next 
two terms arise from the fact that X^"j — X^j G { — 1,0, 1}; in particular, the second term 
accounts for the cases when this difference is 1, and the third term for when it is —1. For 
the second sum, when J„ = i we have X^^j — Xnj = 1 for j 7^ i if and only if X^^j = 1 
and Xnj = 0, that is, if and only if M^{j) = d and Mn{j) 7^ d. Likewise, for the third 
sum, X^^j — Xnj = — 1 for j 7^ i if and only if X^^j = and Xnj = 1, and so if and only if 
Mi,{j) ^ d and MnU) = d. 

To obtain a bound on the variance of Em[Y^ — y„|M„] we apply the inequality 

(fc \ k 

I < A;^Var(Ai) (36) 
i=l / 1=1 

in order to handle the terms arising from f l35|) separately. We will use fl36l) and {Yli=i ^i?' — 
k Yli=i for any Ci, . . . , c^, repeatedly below without further mention. The factor of 1/m 
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outside each sum in ( 135|) contributes a factor of to the variance, which is withheld until 
further notice below. 

To bound the variance of the first sum, we note that 



Var„ Yl T^d)] = Var, 



m 



j=i 



l(M„(i) = d)\ = Vaimim-Yn) = Varm(F„) 
i=i J 

= al^<C,n (37) 

by m- 

For considering the calculation of the variance for the next sum, as Mn{j) = M^{j) when 
Mn{i) = d hj f|T3l) . we have 

P^(M:(j) = d\Mn)l{M4j) ^d) = Pm{Mi{j) = d\Mn)l{Mn{i) ^ d,M^{j) + d), 

and therefore may write 

Pm(M;(j) = t^|M„)l(M„(j)^rf)= Yl (^n,Uhj)+ Y bn,Uhj) 

where for i ^ j we set 

an,mihj) = PmiMi,ij) = d\MMM^it)>d,M^{j)y^d), (38) 
bn,U^,3) = PmiMM) = ci|M„)l(M„(2) < rf,M„(j) ^ d). 
For considering the third sum in fl5^ . we also define 

Cn,Uhj) = PmmU) + rf|M„)l(M„(j) = d). (39) 



In Lemma [3T2| following the proof of this theorem, it is shown that there are constants Cs, Cg 
and Cio such that 



Var^ 



Var^; 



Y2i ^n,m{hj) j < CsTl 1 + 



Y bn,m{i,j) ) < Cg — 
•,l<i,j<m,i^j 



Var^ 



for all n > rii and m G On,ri- Combining ( 137|) and ( H0|) -f l42|) . and accounting for the 1/m 



(40) 
(41) 
(42) 



factors in fl35|) . we have 



< — Cn <! n + n 



< C 



1 + 



m 

+ — +n 
n 



1 + 



n 



m 



12" 



C 



m 
1 



1 + 



n 



+ 



+ 



n 



m 



12" 



n 



+ 1 + 



4 / n \ 6 



<3c,.i|i+(ii)n 

n Vm/ J 



12 



where in the last step and below we use the elementary bound 
l + x^'+x^^ ...+x^^ <{j + l{ij <e}){l + x^) forall x > 0, 1 < ii < . . . < < i. (43) 



Then taking Cj = \^3Ci2 yields 



y/l + (n/m)6 1 + {n/mf 

^n,m S <^7 1= S tv?- 



n \ n 



Verification of Condition [2j Kn and its moments 

Let Jn = {In, Mn{In)}, the Set containing the identity J„ of the selected urn and the number 
Mn{In) of balls it contains, and recall that J^n is the a-algebra generated by J„. For D ~ 
B{n,p) and g G Ni we have 

ED" = J2 SjMjP^ < Yl ^^'"""'P^ ^ ^i3,.(np + {npY) < C^M + M"), (44) 



where in the first equality, due to iRiordanI ( 1l937l ). Sj^q are the Stirling numbers of the second 



kind and {n)j is the falling factorial, and in the second inequality Ci^^g = qmaxi<j<g Sj^g. 
Clearly 

Kr, = l + \d-MMn)\ 

is J-'„-measurable, being a function of Mn{In)- Recalling f[T^ from the construction in 
Section [21 accounting for urn J„ we see that the occupancy of at most Kn urns are different 
in the configurations and M„ for any i. In particular, — F„| < K^. 

By the triangle inequality Kn < {I + d) + Mn{In), and taking q^^ power, by a standard 
inequality and (jHj) we obtain 

EmK^n < '2"-' ((1 + dy + En^Mniln)') < 2"-' ((1 + d)" + Cu,, (1 + {n/mY)) 

< Ci5,g (ci" + 1 + in/my) < Ci6,g (1 + (n/my) . (45) 



We now show that (l22ll . sufficient for ( iTSll . is satisfied. Applying the definition (jSj) of 
rn,m, f pO]) - f l33|) . and the moment bound f HSj) . there is some rii such that for all n > rii and 
any m e Gn^^^ , 

rn,m/in,mfcy^,4 ^ ^ [Ci6,2(l + (n/m)^)]^/' 1 + (^M)' ^ ^ 

c^n,m 1 + (ra/m)'^ ^ \l + [n/mYj 

using (H3|) in this last step. 

Verification of Condition [3t L„ and its moments 

Set 

Ln = Mn{In)y '^■n,m = TTL — 1, Sn,m ~ ] and -Fn,m = {Ln ^ Sn^m}- 

Clearly L„ takes values in {0, 1, . . . ,n}, and n — Sn,m > "^o for all sufficiently large, and 
Ln,4'n,m and are measurable. Now by fl33|) and fl32|) the first part of f|T9l) holds. 
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Let 

Vn = Y,^n,i (46) 

with Xn,i as in (E]). Using that /„ is independent of Mn{j),j = 1, . . . ,m, and the properties 
of the multinomial M {n, m) distribution, we have 

C{Mn{j),J ^ In\Mn{In) = IJn = i) = C{Mn{j),j ^ m|M„(m) = /) = M{n - I, l/(m - 1)), 

and counting the number of urns with occupancy d on both sides of this distributional 
identity yields 

SO the second part of f|T9l) holds on the entire probability space, so in particular on F„ As 
Ln ~ B{n, l/m) under P^, from we obtain 

^mi^^<Ci4,,(l + (n/m)'?). (47) 

Hence, inequality f l23|) . sufficient for fl20|) . holds as 

1 1 

a/1 + (n/m)4^1 + {n/mY (yn,ra\/^ + {n/niY 
— ^17 l=rr^ ; — ; — 7770 S (-^18" 



cTn,mV^ [1 + {n/mff - [1 + {n/mf] 



1 + {n/mY 
1 + (n/m)^ 



where we have used the definition ([5]) of r„^m, the definition of Sn,m, (H5ll and ( H71) in the first 
inequality, (l30l) in the second inequality, ( l29l) in the third inequality, and (H3il for the last. 



Verification of Condition [4] 

We first show that there exists rii such that, for all n > rii and m G 0n,ru 

/^n,m < 18/in-L„,m-l OH -^n,m- (48) 

As n/{n — [a/^) — 1 as n — )■ oo and -Fn,m = {-^n ^ fv^}) there exists rii such that 
n — \y/ri \ > tiq and 



< < 2 on F„ ^, 



(n - L„)d \n - \^/n 

for all n> rii. 

Next, as m > 3 by ( l32l) we obtain — 2m > and therefore, using the first upper 

bound on n/m in (!3T|) for the second to last inequality, we obtain 

V "1^ _ / ]^ _j j < f 1 -I I < e^n/m^ < g61ogm/m ^ 



/i _ 1 ^" V ^ — 2m 

V m-1/ 
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Hence, for all n > n2 and m G 0n,ri, on recalling ([3]) we have 

n—d 



< 



< 2- 



n)d 



< li 



< 1 



(n - Lr,)d 1 



n—d 



1 

m 



n~d 



d\ (m - V m - 1 



— L 



n)d 



1 



n—d 



n—Ln—d 



d\ (m — l)'' ^ \ m — 1 

= 18/i„_L„^m-l- 

By ( l33i) . ( l30l) and ( l28l) hold whenever m G 6n,ri- Now the first part of Condition H] follows 
from ( l48i) . ( l30l) . and ( l28l) since, for all n > rii and m G 0n,ri, 

< C3Hn,m < 18C3/i„_L„,„__i < Cid^.^^ on where Ci = ISCeCs. (49) 

Since for m > 2, 



1 + 



=1+1— ) <l + 8( — ) < 

m — 1/ Vm/ \m — 1/ \m/ 



1 + 1^)' 

m/ 



and now the second part of Condition H] follows with the help of ( H9|) since 



1 + (-) 



1+ (^)^ 



m—1 



Verification of Condition [H and its moments 

With Vn given by (H6l) . we have |F„ — = X„^/„ < 1, so we take Bn = 1, which is trivially 
J-'„-measurable. Now using (l30ll . (H3l) . and (H5ll we obtain 



'^n.e/^n.e-^ml-^n^ri] _ rl elln,ekn,ni,2 Hn,m (1 + {n/mY) 

3 — 5 S (-^16,2—5 — — — — — ; — ^ S '^tvi6,2(-^fi- 



a. 



nfi 



Verification of Condition [6] 

Endowing the set N2 of integers with the discrete topology, a subset of 0„,ri C N2 is compact 
if and only if it is finite. As any function on a set with the discrete topology is continuous. 
Condition [Hb is a consequence of Lemma 13.1^ Part [H □ 

Next we state and prove a lemma used in the verification of Condition [H 

Lemma 3.2 Let d G {2,3,...}. There exists ni > no and constants Cs,Cg,Cio depending 
only on d such that fH0|) -fj42 |) hold for all n > ni and m G Gn.n- 
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Proof: Consider first ( HOj) . By (|T3|) . for all 1 < i, j < m,i ^ j, on Mn{i) > d we have 

m;(j) = m„(j) + K(j), 

so that RliiJ) is the number of the 'excess' M„(z) — balls distributed to urn j, which requires 
d — Mn{j) of them to achieve M\{j) = d. Thus an^mihj) = unless Mnii) — d > d — Mn{j), 
that is, unless M„(i) + M„(j) > 2c?. Hence, from fl38D . 

anAhj) = PmiKU) = rf|M„)l(M„(z) > d, M„(j) 7^ d) 

= Pm{Mn{j) + Kij) = rf|M„)l(M„(^) + M„(j) > 2d,M„(z) > d,M„(j) ^ d) 

= PmiKij) = d~ Mn{j)\Mn)l{Mn{i) + Mn{j) > 2d, M„(z) > d, M„ij) < d), 

where we have used that R^ij) > makes M„(j) > d impossible in the second equality. As 
Mn{i) + Mn{j) > 2d and Mn{j) < d imply that Mn{i) > d, letting p = l/(m — 1) we have, 
that 



anAhj) = PmiKU) = d- M„(j)|M„)l(M„(z) + M„(j) > 2d, M„(j) < d) 

= PmiKU) =d- M„(j)|M„(2))l(M„(^) + M„(j) > 2d, M^{j) < d) 
'Mn{i)-d' 



pd-M^j) _ ^)Af„W+Af„0)-2d i^Mnii) + M4j) > 2d, M4j) < d), (50) 



where we have used that M^t and are conditionally independent given Mnii), and there- 
fore that the conditional distribution of given M„ is the same as that given M„(?), 
specified in (fTT]) . 

Now considering bn,m{hj) in (138!) . using (fT3|) and arguing similarly we obtain 
bn,m{hj) = PUM'nij)=d\Mn)liM^it)<d,M^{j)^d) 

= PmiMnU) - RnU) = M„) 1 (M„ < d, M„(j) d) 
= PmiKU) = Mnij) - rf|M„)l(M„(z) < d, Mn{j) > d). 

By (fTT]) . Rn{j) < d — Mn{i) when Mn{i) < d, and therefore bn,mihj) = unless d — Mn{i) > 
Mn{j) — d, that is, unless Mn{i) + Mn{j) < 2d. Hence 



■"11,171 



[l,]) = PmiRnU) = M„(j) - rf|M„)l(M„(2) + M„(j) < 2d, Mnii) < d, Mnij) > d) 
= PmiRiU) = M„(j) - d|M„)l(M„(z) + M„(j) < 2d, M„(j) > d) 



(n-d\ (d-M„(i)\ 
\ d ) \M„ij)-d) 
(n-M„{i)\ 

^ Mn(j) > 



l{Mnil) + Mn{]) < 2d,Mn{]) > d), (51) 



using that the conditional distribution of -R.^(j) given M„, as Mnij) = ^nU) + -^n(i); 
hypergeometric. As n < 2d implies ("^'^) = 0, which implies bn,mihj) = 0, we assume 
n >2d when proving (1411) . 

Considering Cn,mihj) in fl39|) . let g = 1 — p and write 

Cn,Uhj) = PmiKij) + 0|M„)l(M„(j) = d) = i\- P™(K(j) = 0|M„))l(M„(j) = rf) 
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To prove each of fH0|l - fH2|) we apply the inequahty of lEfron and Steiru ( ll98ll ). Let 
Sn-i{xi, . . . , Xn^i) be a symmetric function of xi, . . . , Xn-i, and suppose that Ui, . . . ,Un 
are i.i.d. random variables. For k = 1, . . . ,n, let Sn,{k) be the value of Sn^i computed by 
omitting the k^^ variable Uk, that is, 

1 

Sn,{k) = Sn-l{Ui, . . . ,Uk-l,Uk+l, . . . ,Un), and Set Sn,(.) = — Sn,{k)- 

^ k=l 



Then by lEfron and SteinI (jl98ll . Equation 1.6) 



Var(S'„,(„)) < E'^{Sn,{k) - 5'„,(.))^ 



k=l 



As the average Sn,(.) minimizes the sum of squares, replacing it by any symmetric function 
of f/i, . . . ,f/„ yields 

n 

Vai{Sn,(n)) < E ^{Sn,{k) - Tnf = nE{Sn,(n) - ^r^)^ (53) 
k=l 

this last equality since the distribution of Sn,{k) — Tn does not depend on k. 

In order to apply (153!) . independently label the n balls 1 through n such that all orderings 
are equally likely, and let the variables Uk € {1, . . . iTn] denote the location of the k*^ ball, 
k = 1, . . . ,n. Note that the three functions (!50!) . (ISTl) and (!52|) can be written for all n as 
Tn = T{n, Mn{i), Mn{j)) for some function T. Hence, applying the Efron-Stein inequality 
with Sn-i{Ui, . . . ,Un-i) = T{n- 1, M„_i(i), M„_i(j)), we obtain Sn,{n) = ^^-i and 
yields 

Var(T„_i) < nEiTn-i - Tnf. (54) 
In particular, to prove fHOj) we apply fl5^ with 



Tn= ^ an,m{hj) and T„_i = ^ an,m,(n){hj) 

where an,m.,{n){,h j) is the value of an,m{hj) in ( 150|) when withholding ball n. As £(M^(j)|M„) = 
£(M;(j)|M„(i),M„(j)), we have that an,m,{n)(«, j) = an,m(«,j) whenever ^ and 
hence 

Tn—l Tn ^ ^ ['-^?i,m,(n) (f^nj j) '2n,m(^n) j)] ~l~ ^ ^ ['2n,m,(n) (^? f^n) '2n,m (^j f^n)] • 

l<j<rn,jjtUn l<i<m,i^Un 

(55) 

By (!50|) we can further restrict the summation of the first sum in (155!) over indices j in the 
union of the random index sets 

^1 = {1 < J < m, J ^ Un : Mn{Un) + M„(j) > 2c/ + 1, M„(j) < d} and 
^2 = {1 < J < m, J ^ : M„(f/„) + M„(j) = 2ci, M„(j) < d}. 
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For j G Ji 



(n){Un,j) - an,m{Un,j) 



\i-py 



MniUn) - d 
d - M„(j) 



2d 



V 



d-Mnij) 



(l-p) 



M„{U„)+A4n(j)-2d-l 



P 



MniUn) - d 



and for j G Ji this last term is bounded above in absolute value by 



P- 



d - Mnij) 



MniUn) - d 

and hence 



<P + 



max 



x+y>2d+l, x<d~l \y — d 



d — X 



<P + . <l + d/2 =: Ci9, 



id + 2)-d 



^n,m,{n)iUn, j) ~ (^n,miUn, j)\ ^ C 
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MniUn) -d\^,_j^^ 

d-Mnij) 



P 



(56) 



(57) 



To bound the right hand side we will use the fact that, for any x,k,£ G Nq satisfying 
0<x< k-i. 



k 

X + 



P^ < 



©, k<ii+l)/p + i 
2K for all k. 



i5l 



The second case is trivial, and to prove the first write 

k 



l<i<x 



ik-i-i+l)p ^ fik- 



l<i<x 



I + 



< 



+ 1 
+ 1 



using the restriction on k in the first case of (158|) . Then applying ( [58|) with £ = 1 and 
x = d- Mnij) - 1 to ([57D yields 

{n)iUn,j) - an,miUn,j)\ 

< Ci9p [(M„(f/„) - d)^l{MniUn) -d<2/p+l} + 2*^"(^")-^l{M„(f/„) -d>2/p+ 1}] 

< Cigp [MniUn) + 2*^"(^") l{M„([/„) > 2/p}] . 



This same upper bound holds for j G J2 as well since 



in)iUn,j) - an,miUn,j)\ 



MniUn) -d 

d-Mnij) 



pd~Mn(j)^l _ p-jMr,{Un)+M„{j)~^2d 



< 



MniUn) ~ d\ ._M„^^^ 
d-Mnij) r 
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and since Ciq > 1 we obtain 



Er, 



iUn,j)] } 

.i<j<m,j^U„ J 



<E^l Ci9p[M„(f/„) + 2^"(^")l{M„(f/„)>2/p}] 

ljeJiUJ2 

< {Cigmp [Mn{Un) + 2*^"(^") l{M„(t/„) > 2/p}] }' 

< 2{Cigmpy {EmMniUnY + E^ [4*^"(^") 1 {M„ (f/„) > 2M] } . (59) 



Using that Mn(Un) ~ i3(?7,, 1/m) we have 

EmMniUnf = {n/m){l - 1/m) + (n/m)^ < n/m + (n/m)^ < 2 



1 + 



(60) 



using (143|) . To bound the second expectation in (!59l) . we let -B„ ^ denote a random variable 
with distribution B{n,s), and note the identity 

£;[w^"-V(5„,.)] = {l-s + swYEf{Bn,sw/{i-s+s^n)) for all > and bounded / (61) 

and the bound 



P{Bn,s >t)< exp [-2n{t/n - sf] 



(62) 



of iHoeffdingI (1l963l ). Letting p = (4/m)/(l + 3/m) and applying (1611) with w = A and the 
bound (l62ll . we have 



[4^-v™i{5„,i/^ > 2/p}] = (1 + 3/m)"P(5„,p > 2/j9) < e^^/^exp 



exp 



3n 8 8p ^' 

^ H 2np 

m np'^ p 



< C20 exp 


"3n 8 









-2n p 

np 



<C2o, (63) 



where in the second-to-last step we used that p/p < 4 and —2np^ < 0, and the final step is 
as follows. The bounds in ( l3T]) imply that 



3n 



m Vm/ n ~ n ' 



and so 



3n 8 ^ 
m np^ ~ n 



l{m — 1)^ rn? 



n 



n n 



m 



< 



2m^ 



n n 



m 

<0, 

n 



implying that the entire term (163|) is bounded by C2o- Combining this bound with (160|) and 
dSlD yields 



.l<j<m,jj^Un 



(64) 
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Now consider the second sum in fl55l) . whose summation index can be further restricted 
to z G J3 U J4, where 

J3 = {l<i<n,t^Un: M„(i) + M„([/„) > 2rf + 1, M„(f/„) < 4 
J4 = {l<t<n,t^Un: Mnii) + MniUn) = 2d, M^iUn) < d}. 



For i G 



M„(i) - c/ 

d-Mn{Un) + 1 



M„(z) - d 

d-M„{i7„)^]__p^M„(i)+M„(C/„)-2d-l I p _ j-]^ _ 



d - Mn{Un) + 1 



M„(Z) + Mn{Un) - 2d 



(65) 



and, by an argument hke (j56|) . the difference above is bounded in absolute value by 

Applying (l58l) with £ = 2 and x = d — MniUn) — 1, we have that (l65l) is bounded in absolute 
value by 



C22P 



2 



l{Mn{i) - d < 3/p + 2} + 2^"(^)-'^l{M„(i) - > 3/p + 2} 



< C^22P [M„(«)2 + 2*^"«1{M„(0 > 3/j9}] < C22P [Mnitf + 2*^"«1{M„(0 > 2Ip}] . 

This same bound holds for i G J4 without the factor C22, since by ( l58l) with £ = 1 and the 
same 



Mn{i) d \ d~^U{U,^)/^ _ \Mnii)+M„iUn)- 

d-M^{U^))^ ^ 



2d 



< 



Hence, as C22 > 1 



-"m.(^!!))^''''"^''"^ - ^ ^^"-^'^ + 2*^"«l{M„(z) > 2/p}] 

< p [M„(2)2 + 2*^"«1{M„(«) > 2/p}] . 



Un)] \ <eAy, C^22p[M„(z)2 + 2^^"«1{M„(^) >2/p}] 
J UeJ3UJ4 



< <^ E^22P[M„(2)2 ^ 2*^"«1{M„(^) > 2/p}] 



1=1 



< {C22mpyEm[M4iy + 2*^"(i)l{M„(l) > 2/p}]^ 

< 2{C22mpf {E^Mn{lY + [4*^"(i)l{M„(l) > 2/p}] } . (66) 
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Now, combining the bound 



E^M^{lY<C^,^,[l + {n/mY] 

obtained from fj44|) with g = 4 and p = 1/m with the bound fj63|) . we have that fl66l) can not 
exceed C23[l + (n/m)^]. Applying this bound together with f lM|) in f l5^ yields 



Var„(T„_i) < nErr,{Tn-i - T^f < C24n 
using in the final inequality. Hence 



m/ \m/ 



n \ 



l+(-) 

m 



Var™ ^ = Var„(T„) < 2^24(^ + 1) 



1 + 



n + 1 



m 







1 + (-)' 







by taking Cg = 2^6*24 and using the elementary bound 



(n+1) 



n + 1 



m 



n + 1 



n 



n 



1 + 



n + 1 



n 



n y 



m 



< 2n 



< 2^+^n 



l+(- 
m/ 



for all j > 1, (67) 



thus proving fj40l) . 

To prove f HTj) we let T„ = J2i^j^ri,m{'i,j) and proceed similarly. In view of (15T]) . 
bn,m{hj) = when Mn{i) + Mn{j) >2d+l, but since bn,m,(n){h j) is calculated when with- 
holding ball n, bn,m,{n){h j) may be nonzero when M„(i) + Mn{j) = 2d + 1 and f/„ G 
a case we thus allow for in our definition of K below. We have 

Tn-l —Tn= ^ {bn,m,in){Un, j) " &n,m(^n, j)] + ^ [&n,m,{n)(^, f4) — bn,m{h Un)] 

= ^[bn,rn,in)iUn,j)-bn,miUn,j)]+ [&n,m,(n)(^, ^n) " V-m(^, ^n)] (68) 

where 

^i = {j:(t^n,j)ei^} and = : (^, f/n) G iT} with 
K = {(z, j) : z 7^ J, M„(z) + M„(j) < 2d + 1, M„(j) > d}. 

For any {i,j)EKwe have 

d J \M„U)-d^ { d )\ n-2d I 



(n-Mn(i)\ 
V M„(j) 



/n-Af„(j)\ 



M„(j)\ {n - Mnil) - M„(j))2d-M„(0-M„(i) 

d J {n-Mn{i))d 



(d-M„(^))! 



< 



2d+l\ 77,2d-M„(j)-M„(j) ^1 ^2d-M„(i)-(d+l) 



(2rf-M„(2)-M„(j))! 

^d-M„(i)-l 



d J {n-d+ l)d-M^{i) 1 



G 



n 



25 



n V n — (i + 1 



d-M„{i) 



< 



G 



25 



(n-c/+l)"'-^^"« 
2d 



25- 



n \2d-d + l 



d-Mn{i) ^ r)dr< 

O25 _ r.d-Mn{i) ^ ^ *-^25 



< — • 2" 



< 



n 



(69) 
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using that on K we have Mn{j) > d + 1, and that n > 2d in the first inequahty on the last 
hne. 

Now considering bn,m,{n){Un, j) for {Un,j) G K, note that by ( ISTI) . n < 2d implies 
bn,m,in){Un,j) = 0, as ("~^~'^) = in this case. Hence we may assume n — 1 > 2d. When 
{Un,j) £ K when n balls are tossed, then when the nth ball f/„ 7^ j is omitted, {Un,j) € -ft' 
still. Hence in this case fl69|) applies both to hn^miPn,]) and bn,m,{n){Un,j), yielding 

- ^n.,m(f^n,j)| < K,m,{n){Un, j) +K,m{Un, j) < ^26/^ for all j G Ki. (70) 

For the sum in fl68|) over i E K2 we have bn,m{h Un) = if Mn{i) + Mn{j) = 2d + 1, and 
otherwise the bound fl69|) holds. To consider bn^m,{n){'i, Un), again assume that n— 1 > 2(i since 
bn,m,{n){h Un) = othcrwisc, as before. In addition, if M^iUn) = d+1 then removing ball n 
leaves d balls in cell Un-, in which case (i, Un) ^ -ft', and hence we assume Mn{Un) > d + 1. 
In this case, after removing ball n the pair {i, Un) remains in K, and ( l69l) applies. Thus, 

|&n,m,(n)(^, ^4) " &n,m(«, f4) | < &n,m,(n)(«, f4) + &n,m(^, f^n.) < (^27/^ for all i G -ft'2, 

and combining this bound with ( 1701) for use in ( l54l) yields 

Vaim{Tn-l) < nEm{Tn~l -Tnf 

[Un,]) - &n,m(t^n, j)] + X] [^n,m,(n){i, Un) - bn,m{i, Un)] 

< nEm \ C2&/n + C27/n \ < nE^ {m(C26 + C27)/n}^ = Cg^ 
[jeKi i&K2 J 

by taking Cg = (Cag + C27)^ so 



2 

n 



2 2 

m ^ m 



Var^ ^fen,m(«, j) = Varm(T„) < Cg < Cg 



n + 1 n 



proving f HTj) . 

For fH2|) we recall expression ( 152|) wherein q = 1 — p, and let r„ = (^n,m{h j) ■ Since 
{i,i) = Cn,m{hj) as long as f/„ ^ {«, j}, we have 

Tn—l Tn ^ ^ \pn^m,(^n){Un, j) (^n,m{Un, i)\ ~l~ ^ ^ [Cn,m,(n) (^) ^n) ^n,in{j', f^n)]- 

Considering the first sum and casing out on whether MniUn) < d or M„(f/„) > d + 1, 

Cn,m,(n){Un,j) - C„,„(^„, j) = l{M„(j) = d} - ) 

= l{M„(j) = 4(M'^-*^"(^")l{M„(t/„) < 4 -pg^^"(^")-'^-4{M„(?7„) > rf + 1}) 
= p ■ l{M„(j) = 4(g'^-^^"(^")l{M„(f/„) < 4 - g*^"(^")-'^-4{M„(f/„) > d + 1}). 

Since the term in parentheses is bounded in absolute value by 1, we have that 

\Cn,m,{n)iUn, j) - Cn,m(t^n, j)| < l{^n(j) = d} , 
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whence 

( 5^[c„,™,(n)(f/n,J)-Cn,n^(t/n,J)] ) < Lf^ HM^^U) = d}] = Em{Y^) 

\jj^U„ J \ j=l J 

= p\al^ + ^ilj < (C28/m)2 (C3/i„,„ + /i^,^) < (C29/m)2 [C^m + w?) < C30, (71) 
using ( l28l) and the trivial bound /i„,m < "ni- Next, 

\Cn,m,(n){^,Un) - C„,™(^,?7„)| = (1 - gl*^"^-'^! ) 1 1{M„ (f/„) = d + 1} - l{M„(f/„) = 4| 

< 1 _ qWn{i)-d\ < ^^^1^-^ _ ^1 n^g^i _ 
Further, since p = l/(m — 1) < 1/2 by ( 132|) . by Taylor series 



|logg| = — log(l — p) <p + 



p 



2(1 - 1/2)2 



p + 2p^ < C^i/m, 



giving 



Erai '^[Cn,m,(n){hUn) - Cn,m{hUn)\ \ < ^ |M„(z) - (i| |log g| 



.2=1 



< 



(C3i/m)2m^ E„(M„(2) - df = ClEm{Mn{l) - df 



i=l 



Cl[VaTm{Mnil)) + (EmMnil) " df] < Cl,[VaTmiMnil)) + 2(E„M„(1))2 + 2rf2] 

= C|i[(n/m)(l - 1/m) + 2(n/m)2 + 2^^] < Cs2[l + (n/m)^] (72) 



by (|43D. Applying ([n]) and ([72]) in (|5ll), we have 

Var„(T„„i) < nEm{Tn^i -T^f < C 3-^12 



, n\' 

l+(- 
m/ 



so 



Var„ [ ^Cn,m{hj) ) = Var„,(T„) < C^sin + 1) 



1 + 



n + 1 



m/ 



by taking Cio = 8C33 and using ( 1671) . This proves (H2l) and thus concludes the proof of the 
lemma. □ 



4 Discussion 



Theorem 13.11 is applied in iGoldsteinI ( 120121 ) to obtain bounds on the normal approximation 
for the number of vertices in the Erdos-Renyi random graph of a given degree. Although the 
graph degree and occupancy problems have some features in common, they also differ in a 
number of significant ways. On balance, the occupancy problem is the more difficult of the 
two for the following reasons. 
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Table 1: Asymptotic domains, wherein Td and ifd are given by (1241) and f{n,m) ~ g{n,m) 
denotes f{n,m) = (1 + o{l))g{n,m). That hma^^Z/i^.m is strictly positive in the central 
domain follows from Lemma 13.11 Part [31 



Definition 






Asymptotic Distribution 
of Yn under 


n/m — )■ 
Ain,m /i e (0, oo) 


Left 

—J- 


-hand domain 
^ 1 


Poi(/i) 


n/m — !■ 


Left-hand 

~ mTd{n/m) 

— !■ OO 


intermediate doma 
^ 1 


m 


n/m ^ p & (0, oo) 


Central domain 
~ mTd{p) 1 v^d(p) G (0, 1) 




n/m — )■ oo 

f^n.m ^ Od 


Right-hand 

~ mTd{n/m) 

— OO 


intermediate domj 
^ 1 


lin 


n/m — i- oo 

fJ'n,m ^ /i e (0, oo) 


Righ 

—7- /i 


>hand domain 
1 


Poi(/i) 



First, the term "^nm required by Condition [T] of Theorem 13 -H the variance of the condi- 
tional expectation of the difference Y^ — Yn, is harder to compute for the occupancy problem. 
In particular, in the graph degree problem one can make a direct bound on this term, but 
here we appear to be forced to rely instead on the use of the Efron-Stein inequality. 

Another significant difference between these two problems is that for graph degree the 
removal of a vertex leaves the connectivity of the remaining graph unaffected, while the 
parameters of the occupancy problem that results after the removal of an urn depends on 
the number of balls that urn contained. In particular, even if the removed vertex in the 
graph degree problem was connected to all other vertices the reduced graph remains non- 
trivial, in contrast to the 'parallel' situation of removing an urn which contains all balls in 
the occupancy problem. As a result, though the graph degree problem is indexed by the 
number of vertices, and the variable of interest is a count over those same vertices, here we 
index by the number of balls, while the count is a sum over urns. The choice is driven by the 
fact that Condition [3] is concerned only with reduced problems of sizes n — L„ that satisfy 
Ln < Sn- And in the occupancy problem, limiting the number of urns that are removed when 
forming the subproblem does not guarantee that the reduced problem will be non-trivial, 
but limiting the number of balls removed does. 

A third important difference is that in the graph degree problem, we consider a graph 
with n vertices and connectivity 6/{n — 1), and the reduced problem is on the graph with 
one vertex removed. There, choosing the parameter space to be 6^ = (0,6] H (0,n — 1) for 
some large b yields that 9 E On implies ipnfi £ ©n-i where Vn,^ = (n — 2)9/ {n — 1), as 
required by Condition |3] of Theorem 13.11 That each parameter space 0^ is a subset of the 
same bounded interval (0, b] simplifies a number of the computations and bounds. For the 
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occupancy problem we have taken G„ to be unbounded for the following reason. When an 
empty cell is removed to form the reduced problem, the differences n/{m — 1) — n/m of the 
ratio of balls to urns equals n/(m(m — 1)). In the central domain this ratio behaves like 1/m, 
summing to the divergent harmonic series. On the other hand, though we appear forced to 
deal with the case where 9 is unbounded, here we obtain results in asymptotic domains in 
addition to the central one. 



Although we state our main result. Theorem II. as a uniform bound holding for all 
n > d and m > 2, th e occupancy p r oblem has been classically studied asymptotically as 



n, m — )■ cxD, such as by iKolchin et al.l (1l978l ). who define the five asymptotic domains given 



in Table [H which together give an essentially complete asyra p totic picture of the n,m ^ oo 



asymptotic with n/m varying from to oo. iKolchin et al.l (Il978l Theorem 4, p. 68) also 



show that, in the uniform occupancy problem, has limiting normal distribution in exactly 
the domains in which an,m oo, i.e., in the left-hand intermediate, central, and right- 
hand intermediate domains. Except for a small portion of the latter, our Berry-Esseen type 
bound in Theorem 11.11 provides convergence to the normal in these domains as well: The 
left-hand intermediate and central domains are covered by Corollary 11.11 and the right-hand 
intermediate domain is addressed in the following. 

Corollary 4.1 Let 

Sn,m = logm + dloglogm — n/m. 
If n,m ^ (yo in such a way that n/m ^ oo, fin,m oo, and 

lini ( ] > 6, (73) 



n,m^oo \^ log logm 

then rn,m — > oo and, in particular, 

sup \P {Wn,m <Z)- P{Z < 2)1 ^ 0. 

Proof: If fl73l) holds then there is e > such that, for all n, m sufficiently large, 

"-™ > 6 + £, or equivalently n/m < logm + {d — Q — e) log logm. (74) 



log log m 

We will use below that log[x(l + o(l))] = logx + o(l). Using Tabled! 
1 2 1 / ^n,m 1 , , mTd{n/m) 



min I Tfi)^e ^/'^ 

log I — - — r^— \ + o(l) = logm + (c? — 6) log(?2/m) — n/m — log((i!) + o(l) 



[l + (n/m)^]2j [ [n/mY 

n/m^dl 

= logm + {d — 6) log(n/m) — n/m + 0(1). (75) 

Noting that x ^ {d — 6) logx — x is decreasing for x > {d — 6)^, for n, m large enough so 
that n/m > {d — 6)^ and (!74|) holds, by ( !75|) we have 

log ^ > log m+ (d — 6) log [log m + (d — 6 — e) log log m] 

— [log m + {d — 6 — e) log log m] + 0(1) 
= {d — Q) log[(l + o(l)) logm] — {d — Q — e) log logm + 0(1) 
= {d — 6) log logm + o(l) — {d — 6 — e) log logm + 0(1) = e log logm -|- 0(1) — )■ oo. 
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An example of a regime satisfying the hypothesis of Corollary 14.11 is 



n 



[m (log m + {d — a) log log m)J , a > 6. 



□ 



(76) 



Then \ogfin,m = aloglogm + 0(1) -> oo by (j3]) and fl76|) . and 5„^m/(loglogm) = a, so (!73|l 
is satisfied. 

Although (!73|) does not cover all of the right-hand intermediate domain, the missing 
part is small s ince i t follows from ra/m — )• oo and fin,m — ^ oo that 6n,m — ^ C)0 (see, e.g., 
Kolchin et al.l (1l978l . p. 41)). Since ( I73ll implies that 6n,m = a(loglogm)(l + o(l)) for some 
a > 6, the only portion of the right-hand intermediate domain in which r„ „j -/^ oo but Yn still 
converges to normal is the narrow asymptotic where 6n,m but 6n,m < 6(loglogm)(l + 

o(l)). We note, however, that the limiting factor in r„ m that determines our bound arises 
when bounding \E'^, a term that also appears when computing a bound to the normal in the 
weaker Wasserstein metric using size bias couplings, such as th e bound obtained by applying 
Corollary 2.2 and Construction 3A of I Chen and RoUinl (120101 ). Hence this behavior appears 
to be unrelated to any aspect of our method that pertains to bounding the Kolmogorov 
distance, and the inductive method in particular. 

Lastly we remark that extensions of the present work to the case where the cell proba- 
bilities are non- uniform is of additional interest, and may likely also be approached with the 
use of Theorem 13.11 



Appendix 

Proof of Lemma 13.11 

Part [D By (1331) . it suffices to show (1271) . as then for any ri > the set on the right hand 
side of (|33|) is finite. By ([3]) and d > 2 we have 

fn\ 1 f ly-'' fn\^ 1 
hm sup iin,m = hm sup 1 < lim sup = 0, 

and similarly 

J ri \ 1 f 2Y-^^ f n \^ 1 

hm sup mim — 1) , , „ , — tt; 1 < , , , hm sup — = 0. 

m^J ^ \d,d,n-2dj m^<^ \ m -\d,d,n-2dj m^^c m^('^-^) 



Hence (|27D holds by dH). 

Part [2al As the mean fin,m is positive over the range of n and m considered, we equiva- 
lently show that the ratio 



'-'n,m _ -, \d,d,n~2dJ V mJ / m — i 



n—d \ jj^d 



is bounded. For d<n< 2d or m = 2 the result is clear, as 

1 f^n,m ^ !• 
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For n > 2d and m > 3, we obtain 



_ _ id,d,n-2d) ~ m) / m - l \ _ 1 A _ ]_\ 



n—d 



n~d 



1 



dim' 



d-i 



m 



-d 



d\{n)dm'^ \ m — 1 



m — 1 

n—d 



n—d 



in)d 1 - 
(i! m'^^^ \ m 

n—d 



n—d 



1 - 



m / \ m — 1 



n—d 



1-1 

m 



i—d^ 



(77) 



For the first term in (177j) . expanding (1 — x) around zero and using d > 3 yields 



2 

1 

m 



-d 



IH h -Rim with < - ^ ^ 

m 



and similarly expanding (1 — x)" around 1/m yields 

\ n—d / \ n—d 

m — 1 / V m 



R 



2,m 



with 



R2.m\ < 



n — d 



n—d— I 



< 



n 



-(n—d—l)/m 



m{m — 1) \ m J m{m — 1) 

Hence, we may bound the first term in (1771) by n'^/dlm'^'^ times 



2d 

1 + — + Ri, 
m 



n—d 

1 ) + R2,m 

m 



1 

m 



i—d 



2d\ 2d ( 1 

1 + — R2,m + — 1 - - 

m m \ m 



n—d 



+ Ri, 



n—d 



1 + R2,m 

m 



-n/m 



< 



< 



m 

^346-"/^'" 

m 



W m J m — I m \ 



^g(d+l)/m 

m(m — 1) 



1 + 



where we have used bounds such as e'^''™' < e'^^'^ for m > 3, and where C34 is a constant 
depending only on d. Hence the first term in (|7711 can be no greater than 

CM^-n/m f ^) + -) < % sup e-x\l + x) 

a! Vm/ V m/ ct! a.>o 
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for all n > 2d, m > 3. 

The next term in fl77|) is also bounded, as 



)2d (l - ^) 



d\{n)dm'^ 
as are the final terms, in view of 



1 - 



1 



m 



n—d 



< a 



35 



n 



m 



-n/m 




m 



n—d 



1 

m 



nj I Km/ 



Part 



-n/rrii 



n/m) 



d—ld/m 



n 



n/m 



d\ 



< — sup e "x 
d\ x>o 



x^d—l 



SO taking this to be C4 suffices to show the first claim. The second now follows from Part [2al 
Part [3j By differentiating and factoring we have 

d\ ■ ^'^{x) = x'^-^e-'^ix^ - 3dx^ + d{3d - l)x - d\d - 1)] 
= x'^-2e-^[x -{d- Vd)][x -d][x-{d + Vd)], 

and by considering the sign of this derivative we see that infj.>o V^d(a^) = mimpaid ± y/d), 
which we now show is positive. Letting y denote either positive value d ± y/d, note that 



yd+i yd-i(^^_i_yy 



y^'\d-l-yf {d±Vdf 



d\ 



{d-l)\ 



d{d — 1 — yy 



{d-iy. d{-lTVdy 
yd-i^d-l-yf 



{d-iy 



■ 1. 



(78) 



Noting also that X]^=o^ ^y^'{d' — yY/d'\ = y hy considering the variance of a Poisson 
random variable with mean y, we have 

^d{y) = 1 - Td{y) - Td{y){y - df/y 
1 



= 1 
= 1 

> 1 
= 0. 



y 



e-yyd+^ ^ e-yy'^iy - df 



d\ 



d\ 



1 fe~yy'^-\d ~l-yY ^ e-yy^{d - yf 



y 



E 



{d-iy 

e-yy^'{d' -yf 



d\ 



(by m) 



d'\ 



Part Hal If the claim fails then there are n*, m* < 00 and sequences rj — )■ 00 and {nj,mj) 
such that, for each j, a"^. > rj but nj < n* or mj < m*. As at least one of the previous 
two inequalities must hold for infinitely many j, by considering subsequences we may assume 
that 

nj < n* for all j, or mj < m* for all j. (79) 
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If the first case of fl79l) holds we have 



oo = hm rj = hm al = hminf al < hminf ( max al ) . (80) 



If it were that m := supj rrij = oo, then there would be a subsequence jfc — )■ oo on which 
, hence (x^ • 

finite set, 



rrij^ — )■ oo, hence cr^ — )■ for each fixed n by Part [H and by taking the maximum over a 



= hm maxa^ > liminf ( niaxcx^ ) , (81) 

A; — ^oo n<n* J— ^oo \n<n* ■'J 

this inequality because the lim inf of a sequence is always at least as small as the limit of a 
subsequence. (|8T|) would be a contradiction of flHOj) . leaving only the possibility that m < oo, 
which also leads to a contradiction since, using the second equality in (1801) . 



= < supa„^ < sup < oo, 

J J n<n* , m<m 



as the last supremum is taken over a finite set. If the second case of fl79p holds then for j 
large enough so that rj > (m*)^, since Yn < m under we have 

(m*)2 < r, < a^^,,^^^ = Var„^(r„J < i?™,(K4) < m^^ < (m*)^ 

again a contradiction. 

Part Hbl If the claim fails then there is £ > and sequences Vj — )■ oo and {nj,mj) such 
that, for each j, cr'^-^mj — '"'j but nj/rrij > {1 + e) logrrij. By PartHaJ taking subsequences if 
necessary, we can assume that nj,mj — )■ oo, and that for all j 

fij/mj > (1 + e) logmj > d. (82) 

Then, using Part |2a] we obtain that for all j sufficiently large 



< rrij j exp[— (nj — d)/mj] < rrij 
Using f l82|) and that the function x ^ x'^e~^ is decreasing for x > d, we have 



^ < mj e-"^/"^^e'^ < rrij ((1 + e) lognijY mj^^^'^e'^ < mj' ((1 + 5) logm,)'^e^ 



giving the contradiction 



00 = hm rj < C^{1 + eYe'^ lim 



m- ^ (log rrij 



0. 



J— )-oo ]^oo 

Part He) Set e = 1/4 and, using f l26|) and Parts HH and l4b| choose ri > large enough so 
that any n, m satisfying ^ > ri also satisfy 

cr^,,m ^ mTd{n/m)(fd{n/m){l — e) and e"'^'^" > 1 — 

Then, using f l25|l . 

^ > ^ > Mn m){l - 2e) > (1/2) inf ^,(x), 

so taking Cq to be this last suffices. □ 
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